Harvesting Indices to Grow a Controlled Vocabulary: Towards Improved Access to Historical Legal Texts
نویسندگان
چکیده
We describe ongoing work aiming at deriving a multilingual controlled vocabulary (German, French, Italian) from the combined subject indices from 22 volumes of a large-scale critical edition of historical documents. The controlled vocabulary is intended to support editors in assigning descriptors to new documents and to support users in retrieving documents of interest regardless of the spelling or language variety used in the documents.
منابع مشابه
Porting Elements of the Austrian Baroque Corpus onto the Linguistic Linked Open Data Format
We describe work on porting linguistic and semantic annotation applied to the Austrian Baroque Corpus (ABaC:us) to a format supporting its publication in the Linked Open Data Framework. This work includes several aspects, like a derived lexicon of old forms used in the texts and their mapping to modern German lemmas, the description of morphosyntactic features and the building of domainspecific...
متن کاملThe effects of captioning texts and caption ordering on L2 listening comprehension and vocabulary learning
This study investigated the effects of captioned texts on second/foreign (L2) listening comprehension and vocabulary gains using a computer multimedia program. Additionally, it explored the caption ordering effect (i.e. captions displayed during the first or second listening), and the interaction of captioning order with the L2 proficiency level of language learners in listening comprehension a...
متن کاملReduplication in Persian Language and Literature
The Reduplications are made by repeating part of the base. The repeated part does not make sense and will never be used alone and is just popular in spoken language. In recent times, they have been used in some texts of poetry and prose, in particular, in stories written in vernacular. This research, with a historical approach, and with an analytical-explanatory method, examines the information...
متن کاملThe Effect of “Narrow Reading” on Learning Mid-Frequency Vocabulary: The Role of Genre and Author
This study investigated the effect of Narrow Reading (NR) on learning mid-frequency words. Vocabulary Size Test (VST) designed by Nation and Beglar (2007) was administered as the first pre-test to 196 students, from among whom 91 students whose vocabulary size ranged between 2100- 3500-word families, , became the target of this study and were randomly c...
متن کاملQuantifying Text Difficulty with Automated Indices of Cohesion and Semantics
We evaluated the effectiveness of new indices of text comprehension in measuring relative text difficulty. Specifically, we examined the efficacy of automated indices produced by the web-based computational tool Coh-Metrix. In an analysis of 60 instructional science texts, we divided texts into groups that were considered to be more or less difficult to comprehend. The defining criteria were ba...
متن کامل